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Seq. ID. Mo 7 



1 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACr- 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB^CSl 
GGGRQTYYQC 
MFkB-MHC-I . 2 
TGGGCTTCCCC 



1 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

1 GCTGGGGTTGAGGGCGGGCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACT^ 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB„CSl 
GGGRQTYYQC 
NFkB_C32 
RGGGRMTYYCC 

Topo_r I_c i eavage_s i t e 
RI^millCITNGYNGKTOIYNY 

1 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 
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! .^AAACCCCAA AACCCCW.A CCC ZTTT T AG AGCCCTGCaG TTGGA.. 
J I AACCTCAGTA TTAaTAAGCT CAGATTTrAA ATATT.AATTA C.AAAACC . 
01 ATGGAGGTTG ATCTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
5 1 _^Ci iAAGACT iCTGAAGA.A,A TT.A.AAGAAGC TAAA.ACGTTG TACTCTTGGA 
0 1 1 CCAGAAAGT _iATTAGATGA aG AAA TCAAT CTCAA.AGTCA TTaTAAAGAT 
251 TTAGA.AGATA i i AAA.ATATT TGCGCAGACA A.ATATTGTTG CTACTCCACG 
' ^ ' AGACTATAAT GA.AGAAGATT TTAA AGTTAT TGC.AAGAAAA GAAGTATTTr 
C.AACTGGACT AATGATCGA.A CTTATTGACA AATGCTTAGT TGAACTTCTT 
TCATCA.AGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
^ 5 I ACTTAAGGGA AATCA.ATTAG CAAAGACCCA TTTATTAACA GCTCTrTCA.A 
CTCAAAAGCA GTATTTCTTT C.AAG ACGAAT GGAACCAAGT TAGAGCAATG 
ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCC A 
0 1 GCGAACTTCT GAAGG AACTC TTGTTCAA TT TTGCGGGAAT AACGTrTTTG 

0 5 i ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGC.A.A-A.A AGGTGGAGC A 
■■ 0 1 GCAGACATGA aTGAACCTCG ATGTTGaTCA ACCTGCAA.AT ACA.ATGTC ^ 

5 1 GA.ATGAGAAA.GATCACT[TC TCAACAACAT CAACGTGCCG AATTGGAATA 
301 ATATGAAATC A.AGAACCAGA ATaTT-TTATT GGACTCATTT TAATAGA.AAT 

1 5 1 .A.ACC.\ATTCT TC.AAAAAGCA TGAGTTTGTG AGTAACAAAA ACA-ATATTTC 
=0 1 aGCGA i GGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
9i I GAATTAGAAA GAAGGT.A.AA.A GATA_aGGTTA TCGAAA.AAAT TGCCTaCATC 

CTTGAG AAAG TG AA.AG ATTT TA ACTTGAAC TACTATTTAA C.AAAATCTTG 
TCCTCTTCCA GAA.AATTGGC GGGA-ACCGAA ACAAAAAATC GAA^ACTTG A 
TAAATAA.AAC i AGAGa.AG.AA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AA 11 U I CTA 
I 20 ! C.AATATAGTG_CCCAA.AGAGT TTTTGACTGG AAGAAACGGT AAGAATTTTC 
I 2i 1 A.AAAGAA.AG i j .AAGAAATAT GTGCAACTAA ACAAGCATGA ACTCATTCAC 
! 30 I AAA.AACTTAT i GC I I GAGAA GATCAATaCA AGAGAA.ATAT GATGGaTGCa 
13 31 GGTTGAGACG TCTGGAAAGC ATTTTTaTTA TriTGATCAC GAAAACaTG' 

. 40 I acctcttatg ga aattg ctg ggatggatat tcgaggatct cgtcgtctgg 
. 4> I C I gattagat gatttttcta tgtcagggag caacagaaaa gttactccaa 
. ;0 I ^a_cctattac tacagaaaga aTaTTTGGCA CGTCATTATG AAAATGTCAA 
: 5 5 i I cgcagactt aaagaaggaa acgg-ttgctg AGGTCCAAGA AAAAGaGGTT 
; 60 1 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC tcagactaat 
. 05 1 ACCGAAGAAA ACTACTTTCG GTCCAATTAT GACTTTCAAT AAGAAGATTG 
I 70 I TAAATTCAGA CGGGAAGAGT ACAAAATTAA CTACAAATAC GAAGTTaTTG 
: 75 1 AACT CTCACT TAATGCTTAA GACATTGaaG aatagaatgt ttaaagatcc 
1301 TTTTGGATTC GCTGTTTTTA ACTATGaTGA TGTAATGAAA AaGTATCaGG 

1 85 I AGTTTCTTTG CAAATGGAAG GAAGTTGGaC AACCA AAACT CTTCTTTGG A 

1 90 1 ACTATGGATA TGGAAAAGTG ATATGATAGT GTAAAGAGAG AAAA.ACTATC 

1951 AACATTCCTA AAAACTACTA AATTAGTTTC TTCAGATTTC TCGATTaTGA 

2001 CTGCACAAATTCTAAAGAGA AAGAATAACA TAGTTATCCA TTCGAAAAAC 

205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCGAGAAGAT 

2101 7GCACTTGAA GGAGGaCaaT ATGCAACCTT ATTCAGTGTT CTTGAAA.A TC 

2 i 5 1 AACAAAATGA C I 1 AAATGCa AAGAAAACaT TaaTTGTTGA AGGAaaGCAA 
20 1 AGAAATTATT TTAAGAAAGA TA ACTTaCTT CAACCAGTCA TTAaVaTTTG 
25 I GCAATATAAT TACaTTaaCT TTaa TGGGA A GTTTTaTAAA CA.AACaa_AAG 
301 GAATTCCTCA aGGTCTTTGA GTTrCATGAA TTTTGTGaTC ATTTTaTTaT 
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"'I p^iil^r-JT'^S^CC'^^'^GCTC CTTAGGATTC CTTAGAGATG AATCA.ATGA. 
^" ^/r^^A^.'^'"^^^ ATCTTCTAAT GAGACTTAGA gTtGACTAT^ 

yr^c- . AC aactc^gag aataatgcag tattgtttat tgagaXacS 
Sl^?p£:±\^^^^"^^^A TCGA-rn-AAA rrcA.ATATGA ag^c-57a 

GTCT^G CCA-n-AAGTC CAACCAA.ArrTGC.AAA.^TAC GG^^GGAT. 

?c2ll?P?r. ^^'^^'^^-■^ gttcaagatt actgcgattg gatog?^?c 

Ir^V^ ^ TGAAA^^CTCT TGCTTTAATG CCA,A.ATA7TA ACTTGAr a 7^ 
C^r-A'^^'^.''''^ TCAATCTAAA CATGC^^GA^.'J^^I^ 

^t^^f;£:i"/^^'^'^GAAGAAA CTAAAGTCGTTTTTAATGAA T^AC^ACr 
C^C '^^ GAAAGACGAT TACAACCGAA GACTTTGCGA AT^S^tS 
?^?r TTTATATCaG GCGGTTACAA ATACATGCAA TCA^fl7r 

CCrrGTCTGGA.^™^^ 

"•^irrr. — -)!'''-^-^''^'^^ AAGCACTTTA TTGAA.ATA-rr CaScTcTA 
, r , p'^SJ"'^-- ^ ' L ' C-^CAGAGT TTCCATGATC CTCAAGGC A AXc^ACrl . 

-il luGGu. M GGGTrn-GGC GTrrrGGGG ^-i-.^LnC 
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:VrEVpVDNQAD NHGIHSALKT CHHIfCEAKTL YSVVTQKVfRC RNQSOSF-v^cn 
'^EDL<JFAQT NtVATPRD^fN EEDFKVIARJC EVFSTGLMIE LIDKCLVELL 
^!5^yfP^^^ LQCFGFOLKG NQ LAiCTHLLT ALSTQKQYFF QDEWNQVRam 
''^'S'f ^ * -nCYLIFQRTS EGTLVQFCGN NVFDtiKVND KFDFCKQKG^ 
ADNC^PRCCS TCKYNVKNEK DHFLN-NrNVP NWNNMKSRTO IFYCTKFNRN 
^£J,^"^-^ SNKNNISAMD R^AQTIFTNIF RiT^RIRKKLK OKVIEialm 
v™S^^ YYLTKSCPLP ENWERKQKI ENLINKTREE KSKYYEr^^s 
r'TTDNKCVTQ FINEFF'-NTL PKDFLTGRNR mFQKXVKKY VELNfct^Tm 
f^^^E^ RHISWMQVET SAKHFYYFDH ENF^/LVKlI R^^ws 
^^^^JT'^ QQKSYSrCTYY YRJGVIV/DVIM ICMSIADLKKE TLAEVOEK^v 
EEWKXSLGFA PGKLRLIPKK TTr RPIMTFN KKIVNSDRXT^^r^ff 

NRMF?CJDPFGr A VFNYDDVMK KYEEFVCKWK OVGOpipFA 
ITdDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNh^VfDS^^ 
FRXKZMKDYF RQKFQPQALE GGOYPTT.FSV LEKEQNDLNA kSxi^?^ 
RNYFKpNLL OPVrNICQ':-i YINPNCKFYK QTKGIPQGLC V^S^^"^ 
AFLrciSLCF LRDES.VfNPEN PNVNLLMRLT DDYLLITTQE NNAVLFE-<I 
! ; F>/.vnCKLOTSF PLSPSKFAKY GMDSVEEQNf V^DYCD^^I 

: : yllff^^^ DrANKTi^NKI. FISCGYKYMQ CAKEYKDHFK Kin.AMSSKitn 
• V, hr^^^^"" TR.AFFKYLVC N'IKDTIFGEE HYPDFFLSTL .^iEI-S^^< 
'.0! -r^YTFh-RVCMl L-cLAKEAKLKS DOCQSLiQYD A ^ '"^ith STTn 
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31 acccaacaacaacaccaagccaaaccccaacacgaag-gcgcracca^cgaccgacaacaccticcacczcaccggccgrra 160 

161 cczaagcacaaggacaaaaagaacaacccccczccccccaaagacccttiaccrzaccaacticacccrzcaaacacacczcg 240 

241 ggct:cgccracctt:raaccgt:ggcaccgi:c::cagccgccacct:ccagccaaccgcgcgt:rrc^acc:ccgccaccggacac 320 

3 21 agcccccggagt:agcccacagaaaccc!:t:acaaa.ccncccgar:gagact:acaccagactcai:cacagcccgt:gcacact:c 400 

401 tiraacat:ggagcccLacact::ragacgagccac<?ccgcat:gacggagcact:tggcaccat:ccaacgccrgcccr:gaaaag 4 80 

481 g-rgacaaccanccgcaaaaccacgcccuragcggcggcaacccgcgaaagcczcccgacgccrgcacacgtcragcacg 550 

5 61 acn.gagacact:caaaaaccT:ccacccaccacaaccccrrra.acgcggct:cracccccci:acccrci:ac:cczcacgctgct: 640 

641 ccaaacacgcaticaricccgcacnaggccccccrcc^gccrcacncctzggaaccgcaccnnctccacnaccccccccaacga 720 

721 acaacccaaactagccccgcttacaaccgacagcagcagaaagaccggcgacnccacccgcgcaacgccaccagccnaaa 800 

901 gacacccrgcaaaacaticraticagccaccaccacacaaaaaaaaticctiatiaacnacaaacaccaaccaacaccrgcggcc 880 

381 accacrcaccraaaacgctracgaccagcaggacaccrtigcacacacatiagcracgczcaacggtiracccgcaacccgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 

1 M T £ H H 7 ? :< S R I R F L E Q r V 20 

1019 TAG CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
^l-^uCTLMDrVQL\/LRGS?ASS 40 

10 7? TAT AGC AAT ATA TGC GAA CGC TTC AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 113 3 
-^^''SUZCZRLRSDVQTSrSZru 60 

113 9 CAT TCG ACT GTA CTT GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
' 51 H S T V V G r D S :< ? □ S G V Q F S S ? 30 
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'""31 iJ^"'' r~ 7'^ -"^""^^■^=----=?.= ---- = 5ac-. = = ::-.z = -.ac-.c;,5gacagc-.a^c^=^::555eag ,272 

86 

. . ^ ^^'^ ^*^^ ■-^'^ Gr^^ ^.GT CGA AGG AAT ' — --ii-* 

^ ^ * ^ - S z. R H L 

1 T 



07 r r ^--5cac ===.ac=^=gaa...cc=ac«gcaacn.cc,= ..=,,,^,^, ,,,5 

113 

:406 = = ^=a. = -.aacccaca.ag AAT CAT GAA GAT -TTT CGA GCC ATG CAT GTA AAC GCA GTA CAA AAT M69 

^' H H D r R A M H V N G V o 'jg 

^^170 GAT CTC GTT TCT ACT TTT CCT AAT '^^r --•^ • t« / « * 

123 D u V s T F = T r 4 f"* "== ^S29 

'"9 ^"""==^^==^-9--^^--"'=^=^--^=^agacc^acaagcaca-<j t ATC GGC :s01 



155 



L0O2 AGT GAT GCC ATC CAT TAC T^A T~A -t-r -'a 1 •/-— • - 

156 S D A M w r ; 7 ; '^^^ ISfil 

CAG ATT TCT CGC ATA CCA CTT TTT .AAA AAT AAT 



^ 5 r F £ A L ? M 0 175 



. r . 0 V- r r p r * k r r r r r 
fls 7^ f" r° t"""" r """^ ''^^ -^^ ^ 

;:'f^78 2 GAA GTT TCC TGG AAT i.rir* * t-t* t'^* . » — _ . 

: E V s w r c ; I^" f r^^ '-"^ -^^ ^^^^ 



1^342 AAG AAG TTT AAG CAA G 5Caac=aacacc5C=a.c==r=acaaccaaccr=ag AT CTA TAT TTT AAC 

^-i V u L r F N 245 

^:1903 TTA CAC TCT ATT TCT GAT CCG AAC ACA CTA CAC ATC TCG CTT CAA TGG ATT TTT CCA AGG 1967 

^ ^ ^ ° « «^ ~ V H M W L Q w : p ? a 265 

f ^ f ^ '^^ =^ =^ AAG CAA TTG CAC AAA CTC ATT CCA CTC GTA 2027 

'2a! T"" f '"^ 7" S""" '''''' ^ "^^^ "^-^ CAA ACA 2087 

'^Se f T° u^'' ""^ ■"^■^ ^ ^"^ "^^^ ^^-^ -^^-^ -C CCA TAT ATT 2147 

'^26 o^"" f ^ ^ ^""^ ^^'^ '-"^ AAC CAG CTG TTT GCC 2207 

^' * f ^ I""- ^"^ ^ -^-^C — ^' AAC C^A AGG ATA 2257 



-^Sd •««->* *f»«w « i. . « 

■ * - J i-kV^ rttrt rtiA 

I : : L >: D 



G ^c^tcczacaaeacrtic-accacc^iaccicczraccag AC CTC CAA ACT 2336 

L £ -7 375 
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23 37 ™C TTC AAA TTA TCG AG A TAG GAG TGT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 23 9 5 

j75 r - :< L S R Y E S r S L H '{ L H S ^ Z K 395 

2397 cc.aacacgccaaact:trcrr2CcacC2acC2.a.caaccag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2 465 

ISS I S £ r £ W L 7 L G 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

* 406 :< ?, S N A :< M C L S D F £ :< R K Q : r A. 425 

25 2 5 GAA TTC ATC TAC TGG CTA TAG AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2 58 5 

426 ZFir^LrtiSFl IP Z L Q S F F r 445 

3'5 3 5 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2 64 5 

4 46 :T£SS DLR^^RT,vr F R k a l w K 465 

2 6 46 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

-60 L - C R ? r r T 5 M K M £ A F Z K I N £ 4 85 

27 05 gcaccrraaagcaccccncqcaaaaagccaacac::- rcag AAC AAT GTT AGG ATG GAT ACT GAG AAA ACT 2775 
•436 MNVRMDTQKT495 

1775 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT .AAG AAG AAT ACC TTT CGT CTC ATT ACG 2 33 5 

495T L ? ? A V I R L L ? K K N T r R L r T 515. 

23 3 6 AAT TTA AGA AAA AGA TTC TTA ATA AAG gcaccaaccccct^gccaccaacgcaccccactizczaacccactia 2906 

' 5 16 M L R :< R r L I K 524 

29 07 -cagcag ATC GGT TCA AAC .AAA AAA ATC TTA GTC AGT ACC AAC CAA ACT TTA CGA CCT GTC 2967 
525 M G S N :< :< M , L V S T N Q T L R P V 542 

2 9'6a GCA TCG ATA CTC AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTC GAG 3 027 
543A S I L K H L I NcT £ SSG I ? FN LE 5 62 

3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gcaac 3 088 

5 63 V r H K L L T F K K 0 L L K H R « F G 581 

30 8 9 cacacaacgcgcgaccccrcaccaccaaccccgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3 155 

5 3 2 R :< :< r r V R : o r 5 91 

3 156 AAA TCC TCT TAT GAT CCA ATA AAG CAA GAT TTC ATG TTT CGG ATT GTT AAA AAG AAA CTC 3 215 

592 :< s c r D a r k q o l m f r z v k k k'L Sll 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CCA GCT 3 275 

6 12 :< O P £ F V r R K r a t i h a t*s d r a S31 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gcaagccracccncccaccggaacuticccaacaa 3 3 43 

532T:<NrVS£AFSYF 643 

33 44 aczcrrrrtcag TT GAT ATG GTG CCT TTT GAA AAA GTC GTC GAG TTA CTT TCT ATG AAA ACA 3 4 05 

5 44 D M V ? F £ :< V V Q L L S M K T 659 

3 40 6 rCA GAT ACT TTC TTT GTT GAT TTT GTG GAT TAT TCG ACC AAA AGT TCT TCT GAA ATT TTT 3 4 65 

5 50 2 D T - - V D r V D ? w T >: s 5 5 h: : ^"^^ 

3 4 66 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gcacaccaac cgc tgaaccgcadcaaca 3 5 32 



5 30 



592 
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35 3 3 czaacca;iact:ag ATA GGA AJ^T TCT CAA TAG CTT CAA AAA GTT GOT ATC CCT CAG GGC TCA 3 59 3 

6 93 r G M S Q r ^ L Q :< V G r ? Q G S 7 08 

3 5 94 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCC 3 653 

103 I S S r L C H c '{ H Z 0 LIDSYLS 728 

3 6 54 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC CAC GAT TTC CTC TTT ATA ACA 3 713 

.7 29FT:<:<KGSVLLRV'/ DDFLFIx 748 

3 7 14 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gcgagcL^ccgccacccc 3777 

749 V M :< :< 0 ;^ :< .< F L N L S L R G 7 64 

3773 caaqccccaaccgccgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3 840 

7 55 F £ :< H M F S T S L £ K T V 778 

30 41 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAG AAT ACT TTT TTT AAT GAA AGC AAG AAA 3 900 

77gr.-JFE:.V3MG I I NMT FFNESKK 7 98 

3 901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3 960 
7 99 ?. M ? F F G F S ^/ N M R S L D T L LAC 818 

J 96 I -C« '-'AA ATT GAT CAA GCC TTA TTT .AAC TCT A.CA TCT GTA GAG CTC ACG AAA CAT ATG GGG 4 020 

315 ? :< IDEAL F U S r S V £ L T ;< H M G 838 

4021 .AAA TCT TTT TTT TAC .\AA • ATT CTA AG qca caccgcgcaaccgaacaacagccgacaaacaa cca^g A TCG 4 089 

33 9 K 3 F F Y :< r L R S 84*8 

4090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

949S L A S FAQ V F I D I T H S S K FN S 868 

4 1Sja TCC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATC TCT ATC AGA GCA CAA GCA TAC TTA AAA 4 209 
969 C C N : Y =1 L C r S M C M R A Q A Y L K 888 

421C AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gcgagcaccraccczaactaga 4274 

38 9 ?. M :< □ : r I P Q R M F I TO 903 

4275 aaacccaccaaccaaccccag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGC AAA AAG TTG GCC 4 339 
90 4 LLNVrGRKrWK:<LA917 

4 34 0 GAA ATA TTA JGGA TAT ACC AGT AGG CCT TTC TTG TCC TCT GCA GAA GTC AAA TC gcacgcgcc 4401 

918£:c. CrTSRRFLSSAEViCW 93 5 

4402 ggcc::cgagaccccagcaacacLgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT CCT TTC AAA 4468 

936 LFC LGrtROGLK 946 

4 46 9 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTC ACT 4 528 

947?SFfCYHPCF£QLr rQFQ^LT 966 

4 529 GAT CTT ATC AAG CCC CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4 588 

967 Q L : K ? L R P 7 L R 0 '/ L F L H ?. R ^ 

4589 GCT GAT TAA cgccatccccaacc caccacacacaccc- rcaccaccggccccncaaacaacaccaccaccaagcaca 4665 

987 A D ' 989 
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46 6 6 qc-^acccccaaagcaagcacaccacaggaccr = r.agcaaagcaaaact:aacc-z?ccaccagccrr:?accga.ccc?ccc 4 74 5 

474 5 czacccccacaccccraagaaagaccgacagcggccgccgaccaccgcccacacgcccaccaaacgggagtiggccaaaca 4 82S 

-;8 2 6 ccaaaagcaacacacgaggccaaccccczti^icaccragaacaaggaaagcggcticcccacaacgaacaacgcccgcacta 4 905 

49 0 6 acgcaaaaagacgaagactacccccraaacaagggggactiaagcatacccgaaggaaaagagagcaa cacacccagcgcc 4 9 8 5 

49 86 Gccgaagaaagcaaggacaacccggaacaagcrcczgcagacgacaggccaaacczcggcgaccgaacctirggcaaaagc 50S5 

50 66 cccaggccat:ccat:ggcggccggcct:cgccac::gagac:gaaaaga.aaccaaggacagcct:gaacaccaacagcz=:ac:cra 5145 
514 6 acgcccLacacaaggcLtirgcrrrLCcccgaccccaaccrsgcacgggcgaaaagaaacagcgnnaagccaccactggac 5 225 
522 5 cccgaaacagccaaaccccccggccczccaaagcggaagccraaagaacctaccgaagcccacgaggcLCcaaaaacccc 5 3 05 

52 06 cccrgant:raaaggaggaacccrccaccgacgaggaaacggacagcccaccagcrcccgaggagaagccraaccz::wi:gc 5 3 85 

53 8 6 aaaaaagaaaacaLcaccgggagacacczcnrgacgaaccagacgcggagagcaccrccagcggacccTirgacgccaaca 5 4 65 

54 6 6 actrcracrrccgaaacgcacggcccraccgccgcrccgaccrcccgcagccccacgcagccaagcgaccaaaggcacc 5 5 44 



FIGURE 16 

^page 1 
(Seq. ID. No. 1) 

1 gcagcgctgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 
61 gcgcgctccc cgctgccgag ccgtgcgccc cctgctgcgc agccactacc gcgaggtgct 
121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 
181 ggacccggcg gctttccgcg cgctggtggc ccagcgcctg gtgtgcgtgc cctgggacgc 
241 acggccgccc cccgccgccc cccccttccg ccaggcgtcc tgcctgaagg agctggtggc 
3 01 ccgagtgctg cagaggctgt: gcgagcgcgg cgcgaagaac gtgctggcct: tcggcttcgc 
3 61 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 
421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 
481 ccgcgtgggc gacgacgcgc tggttcacct gctggcacgc tgcgcgctct ttgtgctggt 
541 ggctcccagc tgcgcctacc aggtgtgcgg gccgccgctg taccagctcg gcgctgccac 
601 tcaggcccgg cccccgccac acgctagtgg accccgaagg cgtctgggat: gcgaacgggc 
661 ctggaaccat agcgtcaggg aggccggggc ccccctgggc ctgccagccc cgggtgcgag 
721 gaggcgcggg ggcagtgcca gccgaagtct gccgttgccc aagaggccca ggcgtggcgc 
7 81 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 
841 gcgtggaccg agtgaccgtg gtzttctgtgt: ggcgtcacct gccagacccg ccgaagaagc 
901 cacctctctg gagggtgcgc tctctggcac gcgccactcc cacccatccg tgggccgcca 
961 gcaccacgcg ggccccccac ccacaccgcg gccaccacgt ccctgggaca cgccttgtcc 
1021 cccggtgtac gccgagacca agcacctccc ctacccctca ggcgacaagg agcagctgcg 
10 81 gccctccctc ctactcagct ctctgaggcc cagcccgact: ggcgctcgga ggctcgtgga 
1141 gaccatcntt ctgggctcca ggccctggat gccagggact ccccgcaggt: tgccccgcct 
1201 gccccagcgc tactggcaaa tgcggcccct gtttctggag ctgcttiggga accacgcgca 
1261 gtgcccctac ggggtgctcc ccaagacgca ctgcccgctg cgagctgcgg tcaccccagc 
1321 agccggtgtc tgtgcccggg agaagcccca gggctctgtig gcggcccccg aggaggagga 
13 81 cacagacccc cgtcgcctgg tgcagctgct ccgccagcac agcagcccct: ggcaggtgta 
1441 cggcttcgtg cgggccrgcc tgcgccggct: ggtgccccca ggcctctggg gctccaggca 
1501 caacgaacgc cgcttcccca ggaacaccaa gaagttcatc tccctgggga agcatgccaa 
1561 gccctcgctg caggagctga cgtggaagat: gagcgtgcgg gactgcgctt ggctgcgcag 
1621 gagcccaggg gttggccgtg ttccggccgc agagcaccgr ctgcgtgagg agatcctiggc 
1681 caagttcctg cactggccga tgagtgtgca cgncgtcgag ctgctcaggt: ctttctttta 
1741 tgtcacggag accacgcttc aaaagaacag gctctttttc taccggaaga gtigtctggag 
18 01 caagctgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggagccgtc 
18 61 ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ccgctgacgt ccagactccg 
1921 cttcatcccc aagccrgacg ggccgcggcc gattgtgaac atggactacg tcgcgggagc 
1981 cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggtga aggcactigtit: 
2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc ctgggcgcct ctgtgctggg 
2101 cctggacgat atccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 
2161 gccgcctgag ctgtaccttg tcaaggtgga tgtgacgggc gcgtacgaca ccatccccca 
2221 ggacaggctc acggaggtca tcgccagcat catcaaaccc cagaacacgt actgcgtgcg 
2281 tcggtatgcc gtggtccaga aggccgccca tgggcacgtc cgcaaggcct tcaagagcca 
2341 cgtctctacc ttgacagacc tccagccgta catgcgacag ttcgtggctc acctgcagga 
2401 gaccagcccg ctgagggatg ccgtcgtcat cgagcagagc tcctccctiga atgaggccag 
2461 cagtggcctc ttcgacgtct tcctacgctt: catgtgccac cacgccgtgc gcatcagggg 
2521 caagtcctac gtccagtgcc aggggatccc gcagggctcc atcctctcca cgctgctctg 
2581 cagcctrgtgc tacggcgaca tggagaacaa gctgtttgcg gggattcggc gggacgggct 
2641 gctcctgcgt ttggtggatg atttcttgtt: ggcgacacct cacctcaccc acgcgaaaac 
2701 cctcctcagg accctggccc gaggtgtccc tgagtatggc tgcgtggtga acttgcggaa 
2761 gacagtggtg aacttcccrg tagaagacga ggccctgggt: ggcacggctt ttgttcagat 
2821 gccggcccac ggcctattcc cctggtgcgg cccgctgctg gatacccgga ccctggaggt: 
2881 gcagagcgac tactccagct: atgcccggac ccccaccaga gccagtctca ccttcaaccg 



2941 cggcttcaag gctgggagga 
3001 tcacagccrg tttctggacc 
3061 caagancctc ctgctgcagg 
3121 tcagcaagtt tggaagaacc 
3181 ctgccacrcc atcctgaaag 
3241 cggccccctg ccctccgagg 
33 01 gactcgacac cgtgtcacct: 

33 61 gctgagtcgg aagctcccgg 
3421 actgcccnca gacttcaaga 

34 81 gagcagacac cagcagccct 
3541 cacacccagg cccgcaccgc 
3 601 catgtccggc tgaaggctga 
3661 gagtgtccag cacacctgcc 
3721 gggccagcct ttcctcacca 
3781 ccagactcgc cattgttcac 
3 8 41 aggtggagac cctgagaagg 
3 901 ccctgtacac aggcgaggac 
3 961 gaggcgcngc gggagtaaaa 



FIGURE 16" 
page 2 
(Seq. ID. No. 1) 

acacgcgtcg caaactcttt 
tgcaggtgaa cagcctccag 
cgtacaggtt tcacgcatgt 
ccacattttt cctgcgcgtc 
ccaagaacgc agggatgtcg 
ccgtgcagtzg gctgtgccac 
acgtgccact cctggggtca 
ggacgacgct gactgccctg 
ccatcctgga ctgatggcca 
gtcacgccgg gctctacgtc 
tgggagcctg aggcctgagt: 
gtgtccggct gaggcctgag 
gtcttcactt ccccacaggc 
ggagcccggc ttccaccccc 
ccctcgccct: gccccccttt 
accctgggag ctctgggaat 
cctigcacccg gatgggggcc 
tactgaacat atgagttttit: 



ggggccttgc ggccgaagtg 
acggtgtgca ccaacatcta 
gtgctgcagc tcccatttca 
atctctgaca cggccccccc 
ctgggggcca agggcgccgc 
caagcattcc tgctcaagct 
ctcaggacag cccagacgca 
gaggccgcag ccaacccggc 
cccgcccaca gccaggccga 
ccagggaggg aggggcggcc 
gagtgtttigg ccgaggcccg 
cgagcgtcca gccaagggct 
-ggcgctcgg ctccacccca 
cacataggaa tagtccatcc 
gccttccacc cccaccatcc 
ttggagtgac caaaggtgtg 
cccgcgggrc aaactggggg 
cagttttgaa aaaaa 



FIGURE 17 
HUMAN TRT PROTEIN SEQUENCE 
(SEQ. NO. 2) 

MPRAPRCRAVRSL.LRSHYREVLPLATFVRRLGPQGWRLVQRGDP 

AAFRALVAQCLVCVPWDARP P PAAP S FRQVS CLKELVARVLQRLCERGAKNVLAFGFA 

LLDGARGGPPEAFTTSVRSYLPNTVTDAI.RGSGAWGLLLRRVGDDVLVHLLARCALFV 

LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRIiGCERAWNHSVREAGVPLGLPA 

PGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPA 

RPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYS 

SGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPL 

FLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQ 

LLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEL 

TWKMSWDCAWLRRSPGVGCVPAAEKRLREEILAKFLHWIiMSVYVVELLRSFFYVTET 

TFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFI 

PKPDGLRPIVNMDYWGARTFRREKRAERLTSRVKAIuFSVLNYERARRPGLLGASVLG 

LDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYC 

VRRYAWQKAAHGHVRKAFKSHVSTL.TDLQPYMRQFVAHLQETSPLRDAWIEQSSSL 

NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAG 

IRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCWNLRKTWNFPVEDEAL 

GGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRR 

KLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPT 

FFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVT 

YVPLLGSLRTAQTQLSRKLPGTTL7ALEAAANPALPSDFKTILD 



FIGURE 18 

Clone 712562 
(SEQ ID NO. 3 ) 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 

TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 

GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 

GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 

CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 

AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 

GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 

GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 

CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 

CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 

GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 

CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 

GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 

TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 

CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 

AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 

TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 

TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCZAGTCTCACCTTCAACC 

GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 

GTGACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 

ACAAGATCCTCCTGCTGCAGGCGTACAGG7TTCACGCATGTGTGCTGC:AGCTCCCATTTC 

atcagcaagtttggaagaaccccacatttttcctgcgggtcatctctgacacggcctccc 
tctgctactccatcctgaaagccaagaacgcagggatgtcgctgggggccaagggcgccg 

CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 

tgactcgacaccgtgtcacctacgtgccactcctggggtcactcaggacagcccagacgc 
agctgagtcggaagctcccggggacgacgctgactgccctggaggccgcagccaacccgg 
cactgccctcagacttcaagaccatcctggactgatggccacccgcccacagccaggccg 
agagcagacaccagcagccctgtcacgccgggctctacgtcccagggagggaggggcggc 
ccacacccaggcctgcaccgctgggagtctgaggcctgagtgagtgtttggccgaggcgt 

GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 

tgagtgtccagcacacctgccgtcttcacttccccacaggctggcgctcggctccacccc 

AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 

cccagattcgccattgttcacccctcgccctgccctcctttgccttccacccccaccatc 

CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 

gccctgtacacaggcgaggaccctgcacctggatgggggtccctgtgggtcaaattgggg 
ggaggtgctgtgggagtaaaatactgaatatatgagtttttcagttttgoaaaaaaaaaa 

AAAAAAAAAAAAAAAA 
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SEQ ID NO. 10 
MetSerValTyrValValGluIieuLeuArgSerPhePhe 

TyrValThrGluThrThrPheGlnLysAsnArgJLeuPhePheTyrArgLysSerValTrp 

SerLysLeuGlnSerlleGiylleArgGlnHisLeuLysArgValGlnLeiiArgGluLeu 

SerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArgLeu 

ArgPhelleProLysProAspGlyLeuArgProIleValAsnMetAspTyrValValGly 

AlaArgThrPheArgArgGluLysArgAlaGluArgLeuThrSerArgValLysAlaLeu 

PheSerValLeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerValLeu 

GlyLeuAspAspIleHisArgAlaTrpArgThrPheVaHjeuArrgValAr-gAlaGlnAsp 

ProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrllePro 

GlnAspArgLeuThrGluVallleAlaSerllelleLiysProGliiAsnThrTyrCysVal 

ArgArgTyrAlaValValGlnLysAlaAlaHisGlyHisValArgLysAlaPheLysSer 

HisValLeuArgProValProGlyAspProAlaGlyLeuHisPiroLeuHisAlaAlaLeu 

GlnProValLeuArgArgHisGlyGluGlnAlaValCysGlyAspSerAlaGlyArgAla 
AlaProAlaPheGlyGly 
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SEQUENCE NO. 4 (DNA) AND SEQUENCE NO. 5 (PROTEIN) 
(TRANSLATION OF A Al 82 hTRT VARIANT) 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his cyr arg glu val leu pro leu ala chr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 

phe arg ala leu val ala gin cys leu val cys val pro trp. asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

120 

ala arg gly gly pro pro glu ala phe thr chr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



tyr leu pro asn thr val thr 
TAG CTG CCC AAC ACG GTG ACG 

140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala cyr gin val ■ cys gly pro 
GCC TAG CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 
CTG GGA TGC GAA CGG GCC TGG 

val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 

ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 
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130 

asp ala leu arg gly ser gly ala 
GAC GCA CTG CGG GGG AGC GGG GCG 

150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAG CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC GAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 

290 

gly ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 

gin his his ala gly pro pro 
GAG CAC GAG GCG GGC CCC CCA 

320 

t:rp asp ■ chr pro cys pro pro 
TOG GAG ACG CCT TGT CCC GCG 

leu tyr ser ser gly asp lys 
CTC TAG TCC TCA GGC GAG AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr lie phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 

leu phe leu glu leu leu gly 
CTG TTT CTG GAG CTG CTT GGG 

410 

val leu leu lys thr his cys 
GTG CTC CTC AAG ACG CAC TGC 
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280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 

300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyrala glu thr lys his phe 
GTG TAG GCC GAG ACC AAG CAC TTG 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTG CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GCT CGG AGG CTC GTG 

370 

arg pro trp met: pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

3^90 

gin arg tyr trp gin met arg pro 
CAG CGC TAG TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG CAG TGC CCC TAG GGG 

420 

pro leu arg ala ala val thr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 



430 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met: ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu ar-g ser 
CTC AGG.TCT 



arg leu phe 
AGG CTC TTT 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 

arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 

leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 



arg glu lys 
CGG GAG AAG 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 

leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 

550 

leu met ser 
CTG ATG AGT 



val thr glu 
GTC ACQ GAG 

580 

lys ser val 
AAG AGT GTC 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



ser val ala 
TCT GTG GCG 

450 

val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 

gly ser arg 
GGC TCC AGG 

phe ile ser 
TTC ATC TCC 

510 
thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 



s 
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590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 

610 

ser giu aia glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 63Q 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn ■ met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAG GTC GTG GGA GCC AGA ACG TTC CGC 

650 650 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
:TC AGC GTG CTC AAC TAG GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 g9Q 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGG CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAG TTT GTC AAG GTG GAT GTG ACG GGC GCG TAG GAC ACC ATC CCC 



730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 
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740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys. ala ala 
AAC ACG TAG TGC GTG CGT CGG TAT GCC GTG GTC GAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG GAG GTC CGG AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 



770 



780 



val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT GCC GCA GGG CTC CAT CGT GTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CGT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 

GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 

GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 

CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 

GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 

CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 

ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 

CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 

CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 

GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 

GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 



GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
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GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 

TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 

ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 

CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 

CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 

GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 

CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 

CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 

CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 

TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 

GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 

AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 21- 
Genomic DNA insert of pGRNX44 

Seq. ID. No. 6 

1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTCCCGCGCTCACACCTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



3 01 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



361 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

4 2 X GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCT7VATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



481 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



********************************************* 

541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
1 CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



601 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 
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CCAAT 



661 



CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

721 TGTTAGAACACTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACT^TCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 

CAP 

**4r************^ 

781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGACGTGGGTATTATGACCCCACAGAAGACCCATAGTCGCTAGAAGTAACTT 



CAP 

************ 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



*** > 

901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

***************> 

961 AACCAGTGTAAGCTACAACTTAACTTTTGTTGGAACAAATTTTCCAAACCGCCCCTTTGC 
TTGGTCACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTTTGGCGGGGAAACG 

1021 CCTAGTGGCAGAGACAATTCACAAACACAGCCCTTTAAAAAGGCTTAGGGATCACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGACCCGTAATCCTTAAGTATTTACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTAAGCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

12 01 AGCAAATTTCCTCCGGCAGTTTCTGGAAAGTAGGAAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCAAA 

1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAATCGTAAAGTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTCCGGAGCCCTCTG 

1321 CCAGAAGTTTCTCGCCCCTTAGATCCAAACTTGAGCAACCCGGAGTCTGGATTCCTGGGA 
GGTCTTCJ^AAGAGCGGGGAATCTAGGTTTGAACTCGTTGGGCCTCAGACCTAAGGACCCT 

TopoII 
******************> 

1381 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 
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1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCCGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 

1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCAAACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

18 01 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

1861 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGCCCGT 

Spl 



E2F 

2281 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 



FIGURE 21 (continued, 4 of 6) 



Spl 



E2F NFkB 
********* ********* ******************** 

2401 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 

hTRTS ' 

******************************> 

2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 

E2F 
******* 

2 581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



* 

2 641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



*************************************** **^*^**^ 

2 701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_II_cleavag 



NFkB 
++++++++++ 
NFkB 



Intronl 

*******************************************.**.**********^^.^^ 

2761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 



. FIGURE 21 (continued, 5 of 6) 

30 01 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

3301 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 

3361 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3601 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3781 GGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3 961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 
CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTT 
GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 



FIGURE 21 (continued, 6 of 6) 

4 081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

Intron2 

4201 AGGCCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 

4 321 GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 



Figure 22 
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FIGURE 23 

EST AA281296 
(Seq, ID, No. 8) 



gc 

caagttcctg 
tgtcacggag 
caagttgcaa 
ggaagcagag 
cttcatcccc 
cagaacgttc 
cagcgtgcnc 



cactggctga 
accacgtttc 
agcattggaa 
gtcaggcagc 
aagcctgacg 
cgcagagaaa 
aactacgagc 



tgagtgtgta 
aaaagaacag 
tcagacagca 
atcgggaagc 
ggctgcggcc 
agagggccga 
gggcgcg 



cgtcgtcgag 
gctctttttc 
cttgaagagg 
caggcccgcc 
gattgtgaac 
gcgtictcacc 



ctgcncaggt 
taccggaaga 
gtgcagctgc 
ctgctgacgt 
atggactacg 
tcgagggtga 



ctttctttta 
gtgtctggag 
gggacgngtc 
ccagactccg 
tcgtgggagc 
aggcactgtt 



FIGURE 24- 
(Seq. JD. No. 9) 



TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGArr 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGr 

CAGCAGTGGCCTCrrCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCAT 

CAGGGGCAAGTC 
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Figure 34 



1 CCCCAAAACC CCAAAACCCC A AAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAG A AATAAAATAT TATTCCCGCA caaatggaga tggatattga 
101 tttggatgat atagaaaatt tacttcctaa tacattcaac aagtatagca 
I 5 1 gctcttgtag tgacaag aa a gg atgc aaaa cattgaaatc tggctcgaaa 

201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 

25 i CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

3 5 1 AATCAGGTAA TGAGG ATTAT TCTATTTTTT AG ATCAC7TC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 

4 5 1 CCTAGCCA AC A ATGATGAGT ATATTAAATT CATATG AGAA TGAGTCAAAG 
501 GATCTCG ATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

601 TATGGGTTTT attacaattg ttttaggtat cgacggtgaa ctcccgagtc 
65 1 ttgagacaat tgaaaaagct gtttacaact gaaggaatcg cagttctgaa 
701 agttctgatg tgtatgccat tattttgtga attaatctca aatatcttat 
751 ctcaatttaa tggatagcta tagaaacaaa cc aaataaac catgcaagtt 
801 taatggaata tacgttaaat cctttgggac aaatgcacac tgaatttata 

851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
i 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
[201 GTAAAAAGTa TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1251 TCACATTCAT AGATCGACCTTCATATATCC AATACGATGA TAAGGAAACA 
! 301 GCAGTCATCC CTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
13 51 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 
1401 AAGAATCGAA CTCTAAATCTTTCGTTAATA AGTATTACCA atcttgattg 

1451 attgaagaga ttgacgaggc aactgcacag aagatcatta aagaaataaa 
1501 gtaactttta ttaattagag aataaactaa attactaata tagagatcag 

1551 CGArCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

1 60 1 atacaaacct TGGTCAA AAT ATTGAGGAAG G aaaagaaga ccagttagca 
1651 aaagaaaaaa taaggcaata aataaaatga gtacagaagt gaagaaataa 

1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGaGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 



CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 
GCKSGTTTTaGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCM ^ ° 

c PKPQNPKTPIK K EKIEVV* K- 



61 ^T^^I^^I^'^'^^^'^^'^'^^^^'^'^'^'^^ATATTGATTTGOATGATATAGAAAATT 

■* h 4- 

^CCTACTATATCTTTTAA 



TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAAc"- ^" "^^^ 



b \ \ ^ "^o ^n, ^ ^ ^ ^ ^ ^ W M ^ * K I - 

n ■^KYYSRTNGDGY*FG*yRKF 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 
l^X H J. ^ ^ _^ ion 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

t ^^^^^^^^S'^SIAALVVTRKDAK 

b T S*YIQQV*QLL*CQERMQ N- 

C J^PNTFNKYSSSCSDKKGC KT- 

^^'^'TGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 
XSX — — — f ^ ^ ^ ^ 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLiAR NRLH*LFQS CKNN* 

b ^EIWLEIAFIDYSKVAKTXR- 

C ^KSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

tcw^gatgaagagcctacgtttagaaatattgctaagaaagLvctcttttLvtcaaa^ 

a SSTSRMQIFITXLS*EN* F* 

^L<5CKSr,* RFFL EKISFK- 
<= ^^fsdanlyndsflrklvlk- 

301 ^^^5^^°^°^-'^*^A'^'rA<3AAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

tttcgcctctcgtttctcatctttaactttgtaatgattacaaatttattttaot^ 

b W \ * ^ L K H Y * C L N K I R * - 

o sgeqrveietl lmfk*nq v m- 

3 61 '^t'^tZZt'^'l^tV^^ 

actcctaataagataaaaaatctagtgaagaattcctcgtaatacctot ° 



c 



E D Y S T ^ °. « ^ ^ ^ S I M E K I T * - 
\ T T r p. ^ ^ ^ * G A L M R K L L N - 

Riilpprsllkehygenyli- 



F/G. 35 



TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 
421 4- + + + + 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGArCGGTTGTTACTACTCATATAA'TTTAA 

a Y*KVNSLDyFPSQQ**VY*T 

b TKR*TV WIISLANNDEYIKF'- 

c LKGKQFGLFP*PTMMSILNS- 

481 

GTATACTCTTACTC AGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGAT^ ^ ^ 

a HMRMSQRISIHQTYQRQTRY 

b I*E*VKGSRyiRLTK DKLAI- 

C YENESKDDDTSDLPKTNSL*- 

^ ^ aaaacgcaagaaaaagtttgataatcgaacagcagaagaacttattgcatttactattcg 
ttttgcgttctttttcaaactattagcttgtcgtcttcttgaataacgtL^tgataagc ^ ^ ^ 

a KTQEKV**SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTI R- 

c NARKSLIIEQQKNLLHLLFV- 



tatgggttttattacaattgttttaggtatcgacggtgaactcccgagtcttgagacaat 
atacccaaaataatgttaacaaaatccatagctgccacttgagggctcagaactctgtta 



a yGFYYNCFRyRR*TPES*DrT 

b mgfitivlgidgelpsleti- 
c wvllqlf*vstvnsrvlrql- 

tgaaaaagctgtttacaactgaaggaatcgcagttctgaaagttctgatgtgtatgccat 

661 + + + ^ ^ ^ 

actttttcgacaaatgttgacttccttagcgtcaagactttcaagactacacatacggta 



i *ksclqlkesqf*kfccvgh 

> ekavyn*rnrssessdvyai- 

: KKXiFTTEG IAVLKVLMC MP L- 



tattttgtgaattaatctcaaatatcttatctcaatttaatggatagctatagaaacaaa 

•* ( + 4-- 1 + 

ataaaacacttaattagagtttatagaatagagttaaattacctatcgatatctttgttt 



YFVN^'SQISYLNLMDSYRNK 
IL *INLKYLISr *WIAIErN- 
FCELISNILSQFNG*L*KQT- 

CCATiATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + ^ ^ ^ g^Q 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

PNKPCKFNGIYVKSFGTNAH 
QINHASLMEYTLNPLGQMHT- 
K* TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 
841 + + + + + + 9Q0 

acttaaatataacctaagaatttcgtatctatgtgtcttacgaaatctctgactaaatcg 
*iyigflkhrytecfrd*fs 

EFIIiDS*SlDTQNALETDLA- 
NLYWILKA * IHRML *RLX *L- 

FiG. 35 

(CONTINUED) 



TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + -t- 960 

AATGTTGTCTAATGGACAAAACTi^TGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL^KKQ- 

c TTDYLF*LI.LLISYIFKRSR- 

GGCGAAATOAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 -t- 4- -t- + + + 1020 

CCGCTTTACTTTTCTTCTGATTrCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a G EMKRRLKKEISKFVDSSVT 

b AK*KED*RKRFQNLLXLL*P- 

C RNEKKTKERDFKIC*FFCNR- 

GG7VATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 + + + 4- 4- 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS*F 

b ELTTRXLATKKKKSYHNPDS- 

c N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + 4. 4- + 4. 4- 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a liKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREXHSLKFIYYS- 

c KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + 4- 4- + + 4- 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQ LLFSFILTIFFD*IiE 

b FSFHSCYFLLS*QYFL,ISWK- 

C FHFTAVIFFYLNNIF*LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 H- 4- 4- : + 4. 4- 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR^TEVT*L»rHIH 

b *KVSNKRSARLR*LSLFTFI- 

c KKYQIREALD*GNLAYSHS*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 4. 4- 4- 4- 4 + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKM- 

c IDLHISNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 4. + 4. 4-^ 4- + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a *CYED*XFRVKKWSRNLNQK 

b SAMRTKF LESRNGAEILIKK- 

c VIi*GLNF*SQEMEPKS*SKR- 



F/G- 35 

(CONTINUED) 



13 81 ^TJ?^!^!!? ft^^^^^^'^'^'^^^TCTTTCCTTAATAAGTATTACCA 

cttaacgcagctataacgttttcttIgcttgaga^otagI^gc^^^ 

b N CVDlAKESNS KSFVNKTfY O 

IAS I L Q K N R T L N s L I S I T : 

1441 ^!'5!T?™J™°^?^fJ^T*^^^^^^C^°CACAC3AAGATCATTAAAGAAATAAA 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAAT^ ^ ^ ° ° 

c ^ T / „^ ° E ^ A Q K I I K E I K - 

1501 ^^^f^^^^JI^^^^^f^^^'T^CTAAATTACTAATATAGAGATCAGCGATCTTCAA 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTC^ ^ ^ ^ ° 

a V T F IN*RIN*ITNIEISDLQ - 
b *^^I'LIRE*TKLLr*RSAIF N- 

c NFY*LENKLNY*yRDQRSSI- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 
1561 + + ^ ^ _^ ^ 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a I' T K*KLN*S*TIKNTNLG QN 

fa *RNKS*TKVRQ*KIQTLV K I- 

<= °EIKAELKLDNKKYKPWSKY- 

-, <: n , A'^'^^^'^^^AAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

lo21 — — — — — — -(.—_— 1 *ron 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTA^ 

a IEEGKE DQLAKEKIRQ*XK* 

c *GRKRRPVSKRKNKAINKMS- 

^'^^^^^^QTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 
1681 + ^ ^ ^ ^ ^ ^^^^ 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*RNKRFIFFNNLLKRGV 

Y RSEEIKDLFFSIXY*KEGF- 
c TEVKK*KIYFFQ*FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 
1741 + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
c GVLGFW 
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2 EVD\-Z-iOADNHCIHSALKTCEEIKEAKTLYSWIOKVIRCRNOSOSHYKDL SI 
i: *: .::!: :| • I t I l-I 

L9 ELELly.OENONDlOVRVK. . . I DDPKQY . . LVNVTAACLLOEGSYYODK 6 2 

52 £Dr<--AOTNIVATPRDYNEEDFKVIAKKEVr.STCLMIECIDKCLVELL LOO 

63 OEKSVIITKALL . . . EVAESDPEFICOLAVYIRNELYlRTTTNYI VAF . 107 

■'^l SSSDVSDROKLOCrCFOLKCNOLAKTHLLTALSTOKOYFFODEWNQVRAM 150 

0 8 CWHKNTOPFIEKYFNKAVLLPNDLLEVCEFAOVLYt 14 4 

• 3 1 ICN"- rRHLYTKYLIFQRTSECTLVOFCGNNVFDHLKVNDKFDKKOKGGA 2 00 

14 S FOATZFKNLY LDR ILSODIRKELTFRKCLQRCVRSKF 181 

20 X AOHN- PRCCSTCKYNVKNEKOHFLNNINVPNWNNMKSRTRIFYCTHF 24 7 

182 SEFKEYOLCKYCTES. .QRKKTMFRYLSVTNKOKWOOTKKK 220 

248 NRN'NO-rKKHEFVSNKNNrSAMDRAOTIFTMIFRFNRIRKKLKOKVIEKl 297 
|. .1 : I .r : :|:..: I |1- t;.il 

2 21 . RKENLLTKLOAI KESEDKSKRETC D IMNVEDAIKALKPAVMKKI 2 64 

298 AYMLEKVKDFNFNYYLTKSCPLPENWRERKOKIEMLINKTREEKSKYYEE 34 7 

11 1 M i l 
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aactcatna attactaan taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
tagatttaat ttagaaagta tcaaugaaa aatggaaatt gaaaacaact aagcacaata 
1 1 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
II tatataagn agggttaaga rtgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
1 1 atgtttgTTg taggaaggta gtiactacta agataaagat gaaagaagat atatcatcac 
) 1 taaagcacn cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
i 1 catccgiaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
1 1 ccacaagaal actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
! 1 tgacttactg gaagtctgtg aaittgcata ggttctctat atttttgatg caactgaatt 
i I caaaaatng tatcngata ggatacntc ataagatan cgtaaggaac tcactttccg 
) 1 taagtgma caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacngg 
i I taagtangc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
\ \ caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
; I ggcaaiaaag gaatctgaag ataagtccaa gagagaaact ggagacataa igaacgnga 
1 1 agatgcaatc aaggcinaa aaccagcagt latgaagaaa atagccaaga gatagaatgc 
) I catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
) 1 gaccncaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
[ I gatccnggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtganc 
; 1 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
\ 1 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
) 1 ttcaagcaat taactcccat atatggccat gttacgtaac ngtctaaca tcttaaaagc 
i I cggtgtnca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
[1 tgagaactcc aagatgncc ctcncaatt ctttagtgcc attgaagctg ttaatgaagc 
E I agnactaag ggancaagg ccaagaagag agaaaatatg aatcnaaag gtcaaatcga 
\ 1 agcagtaaag gaagngttg aaaaaaccga igaagagaag aaagatatgg agttggagta 
1 1 aaccgaagaa ggagaatng rtaaagtcaa cgaaggaan ggcaagcaat acattaactc 
> ! cangaacrt gcaatcaaga tagcagrtaa caagaattta gatgaaatca- aaggacacac 
[ 1 tgcaatcnc tctgatgm ctggnctat gagtacctca atgtcaggtg gagccaagaa 
; 1 giatggncc gncgtacn gtctcgagtg tgcanagic cnggmga tggtaaaata 

I acgngtgaa aagtcctcat ictacatcn cagrtcacct agnctcaat gcaataagtg 
) I ttacnagaa gngatctcc ctggagacga actccgicct tctatgtaaa aacrtttgca 
;i agagaaagga aaacnggtg gtggtactga mcccctat gagigcang atgaatggac 
[1 aaagaaiaaa actcacgtag acaatatcgt latmgtct gatatgatga ngcagaagg 
[ I atancagat atcaatgtta gaggcagnc cattgnaac agcatcaaaa agtacaagga 
\ I tgaagiaaat cctaacatta aaatctttgc agttgactta gaaggnacg gaaagtgcct 
1 1 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcganc 
1 1 aatcttaaag ncatncag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
[ I ctngcccn caaaaaaiag gacaaaagtg agtttcttga gancttcta taacaaaaat 
[ 1 ctcaccccac trttttgtn tangcatag ccanatgaa atnaaaaa ttatctartt 
i I atnaagtta cttacatagt rtatgtatcg cagtctatta gcctancaa atganctgc 
► I aaagaacaaa aaagattaaa a 
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MEIEKNQAQQPKAEKLWWELELEMQENQNDIQVRVKJDDPKQYL 
VNVTAACLLQEGSYYQDKJDERJlYirTK^LLEVAESDPEFICQLAVYIIWELYIRTTTN 
YFVAFCYVHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 
QDIRKELTFRXCLQRC VRS KJ^S EFNEYQLG KYCTESQRKJCTMFRYLS VThnCQKWDQTK 
KJ<jRKJENLLTKLQAFKJESEDKSKR£TGDIMhrVEDAlKALKPAVM^^ 

APKJPNSTLESKYLTFKDLIKJ^CHISEPKERVYKJLGICKYPKTEEEYICAAFGDSASAP 

FNPELAGKJlMKJEISKT\VnENELSAKGNTAEVWDNLISSNQLPYMAMLR>n:.Sm^ 

SDTTHSrVTNKICEPKAVENSKMFPLQFFSAIEAVNEAVTXGFKAKKRENiv^ 

AV!<XVA^KTDEEKJ<JDMELEQTEEGEFVKVT4EGIGKQYrNSIELAlKL^V>nCNLDEr^^ 

HTAJFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQ 

CNKCYLEVDLPGDELRPSMQKJLLQEKGKLGGGTDFPYECIDEV*TTC>nCTHVDNIVILSD 

N^lAEGYSDrNVRGSSI\^SIKKY!CDEVNPNIKJFAVDLEGYGKCLNLGDEFNENNYI 

KJFGMSDSILKflSAKQGGANMVEVlKNFALQKJGQK 
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I tcaatactai taatmtaa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
6 1 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
1 2 1 ictagaagtt tacaaaagcc agartgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagctrttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 ganaagtag caagtrtaat tgataaaaaa agttggnct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtanaaaat ttagtattta acatggacta 
.481 ccagttagat ttaaatgaga gtggtggcca lagaagacac agaagagaaa cagattatga 
54! tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
60 1 ctaaaagaca tcatangn ggtggcttaa agattatttt aataaaaaca attatgatca 
66 1 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ngatgattt 
72 1 ttcacaaaca atcaaactta ctaataatic ttactagacL.gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagatrttta ttatcactag aaagattcaa 
84 1 tatmgaat ataagatctt cttatacaag aaattaaut aattttgaga aaattggtga 
901 gctacngaa actatcncg cagngtcn nctcatcgc cacttacaag gcattcattt 
961 acaagncct tgcgaagcgt tctaatattr agttaactcc tcatcataaa ttagcgttaa 
1 02 1 agatagciaa ttataggtat actcmctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaan tagmctat ccctacctaa ttcaattttg atttctactt 
1 26 1 tgttaarrta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
138! aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1 44 1 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ncagcgna acttgtaagc tacccaagaa arttatgata gcngcacaa 
1 62 1 accTTtgatt agatcaacaa amaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
i 74 1 taaaagaigc tctgnaaia tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1 80 1 taaaganct acrmtata aatnaagci gaccnaaac taagaattat aacacgctaa 
1861 gtatacrm aagtagaacg aatmaan laataacgtt aaaagtgcaa aaangaaic 
i 92 1 nccfcana gaaagcnag aagatanga tagtcmgc aaaictattg cncngtaa 
1981 aaamacaa aatgttaata natcgccag mgctctat cccaacaata tnagaaaaa 
204 i tcctncaat aagcccaatc ttctattm caagcaam gaataanga aaaatttgga 
2101 aaatgtatci atcaactgta ncngatca gcaiatactt aanctatn cagaanctt 
2161 agaaaagaat aaaaaaataa aagcattcat mgaaaaga tattatttat tacaatatta 
2221 tcttganat actaaattat naaaacact tcaaiagna cctgaattaa attaagttta 
228 1 canaanag caanagaag aangactgi gagtgaagia cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccan atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagatmg accaaaacac tgtaagtgal gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca natttgaga ngaacccta gttaatctag 
2521 cagtnaatt aaalctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgmta gtaaaagcat actataaan cccictatgl ttaccaactg gtacttatta 
264 1 cganacaat tcagatagat ggtgattaat laaatanag Otaaataaa tanaaaiat 
270 1 tgaatatnc tngcnatt amgaataa tacatacaat agtcaitttt agtgmtga 
2761 atatamta gnatnaat tcanatiu aagtaaataa nairmca atcaimn 
2821 aaaaaatcg 
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MSRRNQIOCPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK. 

EEDLKLLKPKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKKNGLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETpYDTEKWEISHDQKNYVS[YANQKTSYCWWLia3Y™KJWYDHLNVSINRLETEAE 

FYAFDDFSQTIKJ.TNmSYQTVNrDVNFDNNLCILALLRFLLSLERFNILNIRSSYTRN 

QYNFEKIGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKLVDTNKVQDYFKfLQEFPRLTHVSQQAIPVSATNAVE>n-ma.LKJCVKHANLNL 

VSIPTQFNFDFYFVNLQHLKJLEFGLEPNILTKQKJLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQILKQATTIKNLKJ^WKNQEETPETKJDETPSESTSGMKFFDHLSELTELED 

FSWLQATQEIYDSLHia.LIRSTNLKJ<PKI.SYKYEMEKSKJVlDTPIDLKNrYETLNNLK 

RCSVNISNPHGNISYELTNKJDSTFYKJ^KXTLNQELQHAKYTFKQNEFQFNNVKSAKJE 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNfNIQKNPFNKPNLLFFKQFEQLK 

NLENVSINCILDQHILNSISEFLEKNKLKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNOVYfNOQLEELTVSEVHKOVVVENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 

DSIKKJLESISESKYHHYLRLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 
LCLPTGTYYDYNSDRW 
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MKILFEFIQDKLDIDLQTNSTYKJENLKCGHFNGLDEILTTCFAL 

PNSRjaALPCLPGDLSHKAVIDHCIIYLLTGELYN>A/LTFGYKIARNEDVNNSLFCHS 

ANVhTVTLLKGAAWKMFHSLVGTYAFVDLLlNYTVIQFNGQFFTQIVGNRCNEPHLPPK 

^^^QRSSSSSATAAQIKQLTEPVTNKQFLHKJLNrNSSSFFPYSKJLPSSSSIKKLTDLR 

EAIFPTNLViaPQRLKVRJNLTLQKi.LI<JlHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPK£RVLKJIIV1LQKJLLPQEMFGSKK>JKGKJIKNLNLLLSLPLNGYLPFDSLLKJ<JL 

RLKDFRW1.FISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYFRHDT\VTvfKLITPFIVEYFKTYLVE>WVCRNHNSYTLSNFNHSKJvlJRJIPK^ 

IIAIPCRGADEEEFTIYK^NHKNAIQPTQKJLEYLRNKilPTSFTKrYSPTQIADRJlCE 

FKQRLLKKfNNVLPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TNTGVLKJLF^^VVNASRVPKJ'YELYID^n^RTVI^LSNQDV^NVVE^^EIFKTALWEDK^ 

IREDGLFQGSSLSAPtVDLVYDDLLEFYSEFFCASPSQDTLILKLADDFLIISTDQQQV 

INIKKLAMGGFQKYNAKANRJDIGLAVSSQSDDDTVIQFCAMmFVnECELEVWKHSSTN^ 

NFHIRSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSfNVTQNMOFHSFLQRIlEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKJDNIiLLRKEIQHLQAYIYIYIHIVN 



Figure 47 



Oxytncha LCVS YILSSFYYANLEENALQFLRKJESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRJDESVfNPENPNVNLLMRLT 




Figure 48 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAAT7VATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACC7VAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGAGACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
T ATTCAATTTTGTG CAATG CACAT ATTTGTTAAAG AATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



Figure 49 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKXQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRKEKR 

AERLTSRVKALFSVLNYERA 



Figure 50 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 
GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

agccaggcccgccctgctgacgtccagactccgcttcatccccaagcctgacg 
ggctgcggccgattgtgaacatggactacgtcgtgggagccagaacgttccg 
cagagaaaagagggccgagcgtctcacctcgagggtgaaggcactgttcagc 
gtgctcaactacgagcgggcgcg 



Figure 5 1 



MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERJLRSDVQTSFSIFLHSTVVGF 
DSKPDEGVQFSSPKCSQSELlANVVKQMFDESFERRRNLLMKGFSNfNHEDFRAMHWGVQNDLV 
STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKKNVFEETVSKKRK 
RTIETSITQNKSARJ<£VSWNSISISRFSIFYRSSYKXFKQDLYFNLHSICDRJ^ITVK^ 

LfNAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPYIDTHDDEKlLS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRJFEHLKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 

GKJISNAKJ^CLSDFEKJIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRXDIWK^ 

SMKMEAFEKINEbnWRAiDTQKTTLPPAVIRLLPKXNTFRLITNLRJO^ 

LRPVASiLfaiLI>JEESSGIPFNLEVYMKLLTFKJ<DLLKliRMFGRKXYFVRIDIK^ 

IVKKJ<XKJDPEFVfRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

V/TFCSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKJO^LNLSLRGFEKJTNFSTSLEKTVINFENSNGinW 

FSVNMRSLDTLLACPKIDEALFNSTSVELTFCHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRJCIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 




Figure 52 

ggtaccgamacmccmcncaiaagciaatigcitcctcgaacgctccmatctciggaaataninacaagaactcaataacaataccaagicaaanccaatatgaagg 

tguanagtgatcgataatamcrattttatcggicgnaccaagtataaggacaaaaagaacaacnccnccccctaaagacrmacmanaatnacnttca 

ggncgcnacrmaatcgtggtactgtmagcigctacttctagccaaccgcgtgmctaccccgtcanggatatagctcnggagtagctcacagaaatccttacaaat^^ 

ctgatgagactatanagancattacagiccgtgcatattcnaacaiggagccttacacmagatgagtcacgtcgcatgatggagtamggtatcatccaacgtngccng 

aaaaggngataattatngcaaaatcaigtccnagtggtggtaatccgcgaaagttmtgatgcngcacacgtctagcatgangagatanc^^ 

ctccmaacgcggTtTtarrmctatmctattctcatgngUccaaataigtatcatctcgtattaggctttmccgtmact^^ 

aataatctaaanagmcgcnataangatagtagiagaaaganggtganctactcgtgmatgnattagmaaagatacmgcaaaacamatiagctatcattatataaaa 

aaaatcctataanataaatattaatcaatamgcggtcactatrtatttaaaacgnatgatcagtaggacacmgcatatatatagnaigcnaatggtiacngm 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

CCTTAAATGATTATGTACAACTT6TTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

CGCTTGAGAAGCGATGTACAA ACGT CCTTTTCTATTTTTCTTCATTCGACTGTAGTCGGCTTCGACAGT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatartragttng 

ggatagctaatatatgggcagCTAA TAGC GAATGTTGTAAA^CAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtanctaangtgaaatamacctgcaatiactgtncaaagagangtamaaccgataaagAA 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggnaagatgngcgcacmgaaca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAArrACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTC 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGGGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGGATTTTTTACAGGTCATCCTATAAGAAG^ 

aactaatactgnatccncataactaarmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

langtataaaacnanaccactaacgamtaccagACCTCGAAACTTTCTTGAAATTATCGAGATACGAGTC 1 T I r AGTTTAC 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatrtttttaccanaattaacaatcagATTTCAGAAATTGAATGGCTAGT 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 

AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATC i 1 i i 1 1 1 ) ATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGAC^ 

TACATCAATGAAAATGGAAGCG i 1 1 GAAAAAATAAACGAGgtannaaagtarttmgcaaaaagctaatattncagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAA i i i AAGAAAAAGATTCTTAATAAAGgtanaamnggtcatcaatgiacrnacncraaiciattanag 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgancctcartanaattngcagGCGTAAGAAG 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGT^ 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAAC i I i GTTAGTG AGGCG 1 I i I CCTATTgtaagntatTTTncanggaatmnaacaaanctTTTnagTTGATAT 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATT^ 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTGTGGACACATTGT 

TAAGgiataccaattgngaangtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGATGA 

TACGAAAAAGAAAG GATCA GTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAA I i I 1 I GAATTTATCTTTAAGAGgtgagngcigtcattcciaagnctaaccgttgaagGATTTGAGAA 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATAC riM iM'iTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTCTGAACATGAGGTCTCTTG 

ATACATTGTTAGC ATGTCCTA AAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGGAAATCl I u i i i l ACAAAATTCTAAGgtatacigtgtaactgaataatagctgacaaaiaatcagATCGAGCCTTGC 

ATCCnTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATA TATA TAG 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacttannaactagaaaagccattaattaaccttagATCTTTTCAATGTTATTGGAAGAAAA 

ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGetacetetcegtctcgagacucagcaaiattgacacatcagGCTTTTTTGTCTTGGAATGAGAGATGCTTTGAAACCCTCTT 

tcaLtatcatccatgcttcgaacagctaatataccaatttcagtcattgactgatcttatcaagccgc 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAigtcanncaantanatatacatcctt 

tanactggtgicnaaacaautianacuagtatagctgacccccaaagcaagcatactataggatnctagtaaagtaaaanaatctcgnanagtmgangacngtcin 

atccttatacttnaagaaagangacagiggngctgactactgcccacatgcccanaaacgggagtggnaaacattaaaagtaatacatgaggctaatctccmcamag 

aataaggaaagtggmtctataatgaataaigcccgcaciaatgcaaaaagacgaaganatcnctaaacaaggggganaagcatatccgaaggaaaagagagtaatat 

acccagtgngttgaagaaagcaaggataamggaacaagcttctgcagatgacaggctaaamtggtgaccgaamtggtaaaagccccaggnatccatggtggccg 

gccngctactgagacgaaaagaaaciaaggatagmgaatactaatagctcaraaatgtcrtatataaggtmgtttmcctgacttcaatmgcatgggtgaaaagaaata 

gtgnaagccanangganccgaaatagccaaantcnggncctcaaagcggaagtctaaagaacnangaagcnaigaggcncaaaaactcctcctgamaaaggag 

gaatcttccaccgatgaggaaatggatagcnatcagctgctgaggagaagcctaatttmgcaaaaaagaaaatatcangggagacatcrcngatgaatcagatgcgga 

gagtaictccagcggatccngatgtcaataacnctamctgaaatgtatggtcctactgtcgcttcgacnctcgtagctctacgcagtiaagtgaccaaaggtacc 
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EST2 pep 

EXjplotea pep 

Trans of tecrahymen 

Consensus 



FrVCTEISST vnVYFRHDT Vlt^ KLIT ? FIVE YTK-TYLVEN 

FFYVTEQOKS Y^KTYYVKKN IWDVI-MKMS IAD LJCK ETLA--EVOE 

KHKE GSQXFYYKKP XWKJLVSKLTI \aCVRIQrSEK NTKQMKNNFYQ 

FFY,TE..:<. .i..YYYKK. IW...-KL F..K V. . 



40 
43 

50 



EST2 pep 

Euplotes pep 

Trans of cetrahymen 

Consensus 



NVCRNHNSY- TLSNFNHS 

j^EV£3V^KXSL GFA?G 

KIQLEEENLE KVEEKLIPED SFOKYPQG 

K. . .E F. .G 




FHIXMPCRG 



-- FRF 



FBFIylTFLRK 
EfaiHiOTF.RK 



79 
78 
92 

100 



EST2 pep 

Euploces pep 

Trans of cetrahymen 

Consensus 



ADEEHFTIYK ENHKbZAIQPTiVQKILE^ t rlNK 

DKQKNIK LJ^JI-^JQIIJ^^^S^LWK^£Ki^ 

.K, .K LN.N. ,L. .S--QL.L. .lEKN- 



RPTSFTKIYS FTQIADRIKE 129 

RMFK -DPFGFAVFM 12C 

>£L-G -QKIGYSVTD 130 

. . . . -IG. -YF'. 150 



EST2 pep 

Eupioces pep 

Trans of cecrahymen 

Consensus 



FKQRLU<Kf^ NVL "|^>l 

YD-DVMKKy^ EFVCKWKQVG CgE<§f 



r4HiF 



F, .KWK. .G .H. 



J^FMKFD VKSCYD 
'FATMD ZEKCYD 
VYVTL- 



3t 

fiCifF.T.D 



.CYC 



157 
155 
158 

186 
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S-1 ; EFY VTE TIP QPCN RLF FYR KSV WSK 
S-2: RQHLKRVQLRDVSEAEVRQHREA 
S-3; APITFRREKR AERLTSRVK ALPS VL NYE 

A-1 : AiCF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GIR QJ5L KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 
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A 



Vector 



Genomic DNA Insert 



A5 



B2 



5825 bp Sequenced 
-2 kb Hind lit Fragment 



Vector 



1 kb 



B 



RT Motifs ) 2 3(A) 4(6*) 5(C) 6CD) 

f/ \ \ I ^ 



Introns 




$6 7 8 9 ion 




RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Bdnd A 
RT-PCR w/ M2-B15 Bdnd B 
RT-PCR w/ M2-B16 Bdnd C 



500 bp 
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Poly 4 

t t c 

ta a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B') 



5 (C) 

D D Y L L I T 

3'- ccg ccg acg gag gag tag tgg -5' 
a a aaaaa a a 
t t: t t 
c c 
Poly 1 




Motif B' (4) 

QTKGIPQG 



Motif C (5) 
DDYLLIT 




PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

Ot I/CVSYILSSFYyANLEENAIiQFLRKESMDPEKPETNLLMRIjT 

Ea_pl23 KGIPQGI-CVSSILSSFYYATLEESSLGFLRIDESMNPENPNVNLI.MRLTDDYr-1-XT 

Sp_M2 SIIiSSFLCHFYMEDLXDEYLSFTKKK--- GSVI^LRW 

SC_J3103 DGLFQGSSLSAPXVDIiVYDDIiriEFYSEFKASPS ODTLILKLADDFLIXS 



Q K V G I P Q G 

caa aaa gen ggc azc ccz cag gg < Accuai Genomic Sequence. 

•^P^Y i 

p^;^ 3l g c c z c g 

j^rag _acc aaa gga acc cca caa gg > 

-;-;^g acc aaa gga acc cca cca ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 

;^CC egg ccc ccc caa ggc age ccG AGT TAA GAC AGT AGA AAA AAC ACA GTA AAG ATA TAG 



K 



M 



CAA GAT TTG ATT GAT GAA TAG CTA TCG TTT ACG AAA AAG AAA GGA- TCA GTG TTG TTA CGA 
CilT CTA AAC TAA CTA CTT ATG GAT AGC AAA TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 



K 



GTA GTC gac gac cac ccc ccc acc acc 
CAT CAG ccg ccg a eg gag gag cag egg 



V 



< ^J.g ^j.g g^g 

a a aaaaa a a 

c c c e 

c c 



gac gac ccc ccc ccc aca aca 
O D F L F I T 



< Accuai Genomic Sequence. 
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3' RT PGR Strategy 



mRNA 



5* 



I XAAAAAAAAA-AAXAAA 



1 Synthdsis o* cONA with Primef. 
mRNA 



IXAXAAAAXAAAAAAAA 

l yrin - i i i - i i i x i x i T 



2, First Round PCR Using Outsido Primer and Qo Primor. 



3. Second Round PCR Using msido Primer and Q| Primer. 



■ T - lTl - il X - 1 - 1 1 X X XT T 



4. sequence Second Round PCR Products Using Inside Primer or Qi Primer. 
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-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5 - 6 kb 
7 - a kb 
1 1 - 12 kb 

-Libraries from J.A. Wise 
Sou 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Donnain) Library 
REP Library fronn R. Allstiire 
REP81ES Library (old) 
REPS 1 ES Ubrar/ (new) 
REP41ES Library 



B 



JQ 
O 



P. Nurse 



r 



0> XT *0 CO t 
O t I i ^ 

c CO u) rs — 



< 

Q 

o 

Q 
< 

CD 



o O 



o 

jci n 



P •:§ _ _ 

CO CO <n 

CU tU UJ LU 

UJ f— — — 

C£ CO 00 "TT 



< 

Q 

O 

1 
O 
c 
o 
O 



Q 
O 



Original PGR 
3' RT-PCR 




500 bp 




-603 

^3]0 
.234 
' 194 




Hind III Digested Positive Genomic Clones 
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5* RT PGR Strategy 



mRNA 



I AAAAAAAAAAAAAXAA 



ddA- 



1 . Synthesis o( cDNA with Specific Downstream Primer. 



mRNA - 

5' ■HaM^HHMHHHMIHHMHHaMHM^MB^^HMMHMaHMHBM AAAAAAA^ 



2. Ugate Oiigo with 5'-P and blocked 3* to cDNA using R RNA Ugase. 




4. Second Round PGR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif O 

S.p. Tezlp (429) . WLYNSFIIPILQSFFYITESSDLRNRTv-^FRKDIW . . . (35) . . . 

S.c. Est2p (356). WLFRQLIPKIIQTFFYCTEISSTVT-rv'YFRHDTW ...(35)... 

E.a. pl2 3 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRXNIW ...(35)... 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.p. Tezlp AVTRLLPKK--NTFRLITN-LRKRF . . . (61) . . . 
S.c. Est2p SKMRIIPKKSNNEFRIIAIPCRGAD . . . (62) . . . 

E.a. pl23 GKLRL:PKK--TTFRPIMTFNKr:iV ...(61)... 

* ■*■ * ★ * * * 

Motif 3 (A) AF 

h hDh GY h 
S.p. Tezlp KKVFVRIDIKSCYDRIKQDLMFRIVK . . . (89) . . . 
S.c. Est2p ElVFKKFDVrlSCYDSIPFJyiECMRILK ...(75)... 

E.a. pl23 . :<1FFATMDIEKCYD3VNREKLSTFLK . . . (107) . . . 

* * ★ ★ * * 

Motif 4 (B' ) 

hPQG pP hh h 
S.p. Tezlp YLQKVGIPQGSILSSFLCHF^-MEDLIDEYLSF . . . (6) . . . 
S.c- Est2p YIREDGLFQGSSLSAPIVDL\r/DDLLEFYSEF . . . (8) . . . 
E.a. pl23 YKQTKGIPQGLCVS3ILSSF\'YATLEESSLGF ...(14)... 

Y MoLif 5(C) Mocif 6(D) 

h F DDhhh CA\ h cK h 

S.p. Tezlp V1LRVVDDFLFITVNKKD.^JCKFLNLSLRGFE:<HI\"STSLE . (205) 

S.c. Est2p lilkladdfliistix3QQvinikklamggfq:<yT':aka:jrdkilavssqs . ( 173 ) 

E.a. pi23 L1MRLTDDYLLITTQE^^^I.AVLFIEKLI^A/SRENG^:<F^^KLQTSFPLS . (209) 
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Figure 65 

Disruption strategy for the putative telomerase genes. 
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(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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Figure 68 



1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAG CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 . ■ 

gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAG GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAG GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAG GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAG 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAG GAC ACC ATC CCC CAG 



180 

asp arg leu chr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAG TGC GTG CGT 



giy thr ser ala arg 
GGG ACG TGC GCA AGG 

220 

gin gly ile pro gin 
GAG GGG ATC CCG GAG 



leu cys tyr gly asp 
CTG TGC TAG GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTG 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

28 0 

arg giy val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 

GTG GTG AAC TTC CCT 

310 

phe val gin met pro 

TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TGC 



phe lys ala gly arg 

TTC AAG GGT GGG AGG 

370 

arg leu lys cys his 

CGG CTG AAG TGT CAC 



arg tyr ala val val 

CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 

CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TGC ATC CTC TGC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 ' 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 

GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 

ACC CTG GAG GTG CAG 



ile arg ala ser leu 

ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 

AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TGC TAG GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 

TTT GCG GGG ATT' CGG 

260 

asp phe leu leu val 

GAT TTC TTG TTG GTG 



leu arg thr leu val 

CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 

AAC TTG CGG AAG ACA 



leu gly gly thr ala 

CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 

CGC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAG TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 

CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 

TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile Cyr lys lie leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAG AAG ATC CTC CTG CTG GAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAG AGG TTT GAG GGA TGT GTG GTG CAG GTG GCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu Chr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TGT CTG AGA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT GTG CTA CTC CAT CCT GAA AGC CAA GAA" CGC AGG .GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCG CTC GGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAG TGG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT GAG CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAG GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA GAC CCA GGC CTG GAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTT CAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

Hs TCP 1 ... YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep pl23 ...KSLGFAPGKLRLIPKKT-TFKPIMTFNKKIV.. 

Sp Tezl ...QKTTLPPAVIRLLPKKN--TFRLITNLRKRFL.. 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFFUIAIPCRGAD. 

Hs TCPl ...ARPALLTSRLRFIPKPD-GLRPIVNMDYVVG 
consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep pi23 ...PKXFFATMDIEKCYDSVNREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRJVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCPl ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 
consensus F D YD 

Motif B hPQG pS hh 

Ep p 1 23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCPl ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 ...PNVNLLMRLTDDYLLITTQENN... 

SpTezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

Hs TCPl ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D Ghh cK 

Ep p 1 23 ...NVSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAICA... 

HsTCPl ...LRTLVRGVPEYGCVV... 
consensus G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 i CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
251 CCCGCCGCCCCCTCCTTCCGCCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
3 0 1 CCG AGTGCTG C ANANGCTGT GCGANCGCGG CGCG AANAAC GTGCTGGCCT 
351 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
401 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTC ACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
60 1 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
70 1 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
80 1 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGG AG GGTGCGCTCT CTGGC ACGCG CCACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
100 1 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
105 1 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCCAGGC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 
1201 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1251 CGC AGTGCCC CTACGGGGTG TTCCTC AAGA CGCACTGCCC GCTGCGAGCT 
1301 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
140 1 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
145 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TC ATCTCCCT GGGGAAGCAT 
1551 GCC AAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1 60 1 CGCTTGGCTG CGCAGGAGCC C AGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
1 70 1 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACC AC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 80 1 TGCAAAGCAT TGGAATCAGA C AGC ACTTGA AGAGGGTGC A GCTGCGGGAG 
1 85 1 CTGTCGGAAG CAG AGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1 90 1 G ACGTCC AG A CTCCGCTTC A TCCCC AAGCC TG ACGGGCTG CGGCCGATTG 
1 95 1 TG AAC ATGGA CTACGTCGTG GGAGCC AGAA CGTTCCGCAG AGAAAAGAGG 
200 1 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCC A CAGGGCCTGG CGC ACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
220 1 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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2301 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
235 1 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
240 1 GCCCGCTGAG GGATGCCGTC GTCATCG AGO AGAGCTCCTC CCTG AATGAG 
245 1 GCCAGGAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
250 1 CGTGCGCATC AGGGGC AAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
255 1 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
260 1 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TCAGG ACCCT GGTCCG AGGT GTCCCTG AGT ATGGCTGCGT GGTG AACTTG 
275 1 CGG AAGA CAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
2851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCCA TCAGAGCCAG TCTC ACCTTC AACCGCGGCT TCAAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
300 1 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGC ACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGA A GAACCCC ACA 1 111 1 CCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
3201 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
335 1 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
340 1 CGC AGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTCAT 
345 i GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTGAC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
355 1 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACC AGGAGC CCGGCTTCC A 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG 1 1 1 I ICAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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Figure 74 



met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

oro arc ala oro arg cys arg ala val arg ser leu leu arg ser 
hcG CG? GCT CCC CG? TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 3° 
his cyr arg crlu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC aAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

cly oro gin gly crp arg leu val gin arg gly asp pro ala ala 
GGC CCC GAG GC-C TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCi 



60 



nhP ara ala le^^ val ala gin cys leu val cys val pro trp asp 
?TC Cci GCG Ct5 GTG GCC CAG TGC CTG GTG TC-C GTG CCC TGG GAC 

70 

^r-a nro oro ore ala ala pro ser phe arg gin val ser cys 
Gcl CCG clc CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 ?° 
leu ivs glu leu val ala arg val leu gin arg leu cys 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

g^.v ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC G^oG 



120 



ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



130 

cyr leu pro asn thr val chr asp ala leu arg gly ser gly ala 
TAG CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 



Figure 74 (cont.) 



140 150 
trp glv leu leu leu arg arg val gly asp asp vei leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

'leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 130 
ala nyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAG CAG GTG TGC GGG CCG CCG CTG TAG CAG CTC GGC GCT GCC 

190 

zhr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CC-G CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 
leu gly cys glu arg ala trp asn his ser val arg giu ala gly 
CTG GGA TGC GAA GGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

250 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCG AGC CGA AGT CTG CCG TTG CCC AAG AGG GCC AGG CGT GGC 

250 

ala ala oro giu pro glu arg thr pro val gly gin gly ser -rp 
GCT GCC CCT GAG CCG GAG GGG ACG CCC GTT GGG CAG GGG TCC TGG 

250 270 
ala his pro gly arg chr arg gly pro ser asp arg gly phe cys 
GCC CAG GGG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala chr ser leu giu 
GTG GTG TCA GCT GGC AGA GCG GCG GAA GAA GCG ACG TCT TTG GAG 
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giy ala leu 
GGT GCG CTC 



gin his his 
CAG CAC CAC 



iirp asp chr 
TOG GAG ACG 



leu t:yr ssr 
CTC TAC TCC 



leu ser ser 
CTC AGC TCT 



glu thr ile 
GAG ACC ATC 



arg arg leu 
CGC AGG TTG 



leu phe leu 
CTG TTT CTG 



val leu leu 
GTG CTC CTC 



ala ala gly 
GCA GCC GGT 



290 

ser gly thr 
TCT GGC ACG 



ala gly pro 
GCG GGC CCC 

320 

pro cys pro 
CCT TGT CCC 



ser gly asp 
TCA GGC GAC 

350 

leu arg pro 
CTG AGG CCC 



phe leu gly 
TTT CTG GGT 

380 

pro arg leu 
CCC CGC CTG 

glu leu leu 
GAG CTG CTT 

410 

lys chr his 
AAG ACG CAC 



val cys ala 
GTC TGT GCC 



arg his ser 
CGC CAC TCC 

310 

pre ser tihr 
CCA TCC ACA 



pro val tyr 
CCC GTG TAC 



lys glu g-ln 
AAG GAG CAG 



ser leu thr 
AGC CTG ACT 

370 

ser arg pro 
TCC AC-G CCC 

pre gin arg 
CCC CAG CGC 

400 

gly asn his 
GGC AAC CAC 

CVS pro leu 
TCC CCG CTG 

430 

arg glu lys 
CGG GAG AAG 



his pro ser 
CAC CCA TCC 



ser arg pro 
TCG CGG CCA 



ala glu thr 
GCC GAG ACC 



ieu arg pro 
CTG CGG CCC 



gly ala arg 
GGC GCT CGG 



trp met pro 
TGG ATG CCA 



tyr trp gin 
TAG T'GG CAA 



ala gin cys 
GCG CAG TGC 



arg ai.a aj.a 
CGA GCT GCG 



pro gin gly 
CCC CAG GGC 



300 

val gly arg 
GTG GGC CGC 



pro arg pro 
CCA CGT CCC 

330 
lys his phe 
Ai^.G CAC TTC 



ser phe leu 
TCC TTC CTA 

360 
arg leu val 
AGG CTC GTG 



gly thr pro 
GGG ACT CCC 

390 

mec arg pro 
ATG CGG CCC 



pro tyr gly 
CCC TAC GGG 

420 
val chr pro 
GTC ACC CCA 

ser val ala 
TCT GTG GCG 
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ala pre glu 
GCC CCC GAG 



leu arg gin. 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



mec ser vai 
ATG AGC GTG 



gly cys vai 
C-GC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu ohe 
AGG CTC TTT 



lie gly ile 
ATT GGA ATC 



440 

glu glu asp 
GAG GAG GAG 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

5Q0 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAG TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his crp 
CTG CAC TGG 

560 

phe phe cyr 
TTC TTT TAT 



phe tyr arg 
TTC TAG CGG 

590 

arg gin his 
AGA CAG CAC 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

4 90 

leu arg asn 
CTC AGG AAC 



leu ser leu 
CTC TGG CTG 

520 

ala crp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 

550 

leu met ser 
CTG ATG AGT 



val chr glu 
GTC AGG GAG 

580 

pro ser val 
CCG AGT GTC 



leu lys arg 
TTG AAG AGG 



arg arc leu 
CGT CGC CTG 



val tiyr gly 
GTG TAC GGC 



gly leu crp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg giu 
CTG CGT GAG 



val cyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 



450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 

phe ile ser 
TTC ATC TCC 

510 
chr crp lys 
ACG TOG AAG 



pro gly val 
CCA GGG GTT 

540 

giu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 

arg glu leu 
CGG GAG CTG 
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ser gi'- ala 
TCG G.^A GCA 



leu chr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



arg glu iys 
AGA GA^. AAG 



phe ser val 
TTC AGC GTG 



gly ala ser 
C-GC C-CC TCT 



-hr phe val 
ACG TTC GTG 



tyr phe val 
TAG TTT GTC 



gin asp arg 
CAG GAG AGG 



asTx thr cyr 
AAC ACG TAG 



his giy his 
CAT GGG CAC 



glu val arg 
GAG GTC AGG 

620 

arg ieu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 

650 

arg ala glu 
AGG GCC GAG 



leu asn cyr 
CTC AAC TAG 

680 

val leu gly 
GTG CTG GGC 



ieu arg val 
CTG CGT GTG 

710 

lys val asp 
AAG GTG GAT 



leu rhr glu 
CTC ACG GAG 

740 

cys val arg 
TGC GTG CGT 



val arg iys 
GTC CGC AAG 



610 

gin liis arg 
CAG CAT GGG 



phe ile pro 
TTC ATG CGC 

640 

tyr vai val 
TAG GTC GTG 

arg leu chr 
CGT CTC ACC 

670 

glu arg ala 
GAG CGG GCG 



leu asp asp 
CTG GAC GAT 

700 

arg ala gin 
CGG GCC CAG 

val thr gly 
GTG ACQ GGC 

730 

val lie ala 
GTC ATC GCC 

arg tyr ala 
CGG TAT GCC 

760 

ala phe iys 
GCC TTC A.AG 



glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



giy ala arg 
GGA GCC AGA 



ser arg val 
TCG AGG GTG 



arg arg pro 
CGG CGC CGC 



ile his arg 
ATC CAC AGG 



asp pro pro 
GAC CCG CCG 



ala tyr asp 
GCG TAG GAC 



ser ile ile 
AGC ATC ATC 



val val gin 
GTG GTC CAG 



ser his val 
AGC CAC GTC 



pre ala leu 
CCC GCC CTG 

630 
gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 

660 
lys ala leu 
.AAG GCA CTG 



gly leu leu 
GGC CTC CTG 

690 
ala trp arg 
GCC TGC- CGC 



pro glu leu 
CCT GAG CTG 

720 
thr ile pro 
ACC ATC CCC 



lys pro gin 
?^J\A CCC CAG 

750 
iys ala ala 
AAG GCC GCC 



ser thr leu 
TCT ACC TTG 
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chr asp ieu 
ACA GAC CTC 



giu thr ser 
GAG ACC AGC 



ser ieu asn 
TCC CTG .^T 



phe mec cys 
TTG ATG TGC 



gin cys cln 
GAG TGG CAG 



cys ser ^ leu 
TGC AGC ^CTG 



ile arg arg 
ATT CGG CGG 



leu vai chr 
TTG GTG ACA 



leu vai arg 



lys Chr val 
AAG ACA GTG 



Chr aia phe 
ACG GCT TTT 



770 

gin pro tiyr 
CAG CGG TAG 



pro leu arg 
CGG CTG AGG 

SOD 

glu ala ser 
GAG GCC AGG 



his his ala 
CAG CAG GGG 

S30 

gly ile pro 
GGG ATC CGG 



cys nyr gly 
TGC TAG GGG 

360 

asp gly leu 
GAG GGG CTG 



pro his leu 
CCT CAC CTC 

890 

gly val pro 
GGT GTC CCT 



val asn phe 
GTG AAC TTG 

920 

val gin met 
GTT CAG ATG 



met arg gin 
ATG CGA CAG 

790 

asp ala va.1 
GAT GCC GTC 



ser gly leu 
AGT GGC CTC 

820 

val arg ile 
GTG CGC ATC 



gin gly ser 
GAG GGC TCC 

850 

asp met glu 
GAC ATG GAG 



leu leu arg 
CTC CTG GGT 

880 

chr his ala 
ACC CAC GGG 



glu cyr gly 
GAG TAT GGC 

910 

pro val glu 
CCT GTA GAA 



pre ala his 
CGG GCC CAC 



phe val ala 
TTC GTG GCT 



val ile glu 
GTC ATC GAG 



phe asp val 
TTC GAC GTC 



arg gly lys 
AGG GGC ^XQ, 



ile leu ser 
ATC CTC TCC 



asn lys leu 
AAC AAG CTG 



lau val asp 
TTG GTG GAT 



lys chr phe 
AAA ACC TTC 



cys val val 
TGC GTG GTG 



asp glu ala 
GAC GAG GCC 



g ly 1 eu phe 
GGC CTA TTC 



780 

his leu gin 
CAC CTG CAG 



gin ser ser 
CAG AGC TCC 

810 

phe leu arg 
TTC CTA CGC 



ser tyr val 
TCC TAG GTC 

84 0 

chr leu leu 
ACG CTG CTC 



phe ala gly 
TTT GCG GGG 

370 

asp phe leu 
GAT TTC TTG 



leu arg chr 
CTC AGG ACC 

900 

asn leu arg 
A-a^C TTG CGG 



leu gly gly 
CTG GGT C-GC 

930 

pro trp cys 
CCC TGG TGC 
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gly leu ieu leu asp thr arg 
GGC CTG CTG CTG GAT ACC CC-G 

950 

ser ser zyr ala arg chr ser 
TCC AGC TAT GCC COG ACC TCC 



arg gly phe lys ala gly arg 
CGC GGC TTC AAG GCT GGG AGG 



v/al leu arg leu lys cys his 
GTC TTG CGG CTG AAG TGT CAC 



asn ser leu gin thr val cys 
AAC AGC CTC CAG ACG GTG TGC 

1010 

leu gin ala cyr arg phe his 
CTG CAG GCG TAC AGG TTT CAC 



his gin gin val crp lys asn 
CAT CAG CAA GTT TGG AAG AAC 

104 0 

ser asp zhr ala ser leu cys 
TCT GAC ACG GCC TCC CTC TGC 



940 

chr leu glu val gin ser asp cyr 
ACC CTG GAG GTG CAG AGC GAC TAC 

960 

ile arg ala ser val chr phe asn 
ATC AGA GCC AGT GTC ACC TTC AAC 

970 

asn mec arg arg lys leu phe gly 
AAC ATG CGT CGC A.AA CTC TTT GGG 

S90 

ser leu phe leu asp leu gin val 
AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

chr asn ile cyr lys ile leu leu 
ACC A^C ATC TAC AAG ATC CTC CTG 

1020 

ala cys val leu gin leu pro phe 
GCA TGT GTC- CTG CAG CTC CCA TTT 

1030 

pro chr phe phe leu arg val ile 
CGC ACA TTT TTC CTG CGC GTC ATC 

1050 

cyr ser ile leu lys ala lys asn 
TAC TCC ATC CTG AAA GCC AAG AAC 



ala gly rriet ser leu gly ala lys 
GCA GGC- ATG TCG CTG GGG GCC AAG 

1070 

ser glu ala val gin crp leu cys 
TCC GAG GCC GTG CAG TGG CTG TGC 



ieu chr arg his arg val chr cyr 
CTG ACT CGA CAC CGT GTC ACC TAG 



gly ala ala gly pro leu pro 
GGC GCC GCC GGC CCT CTG CCC 

1080 

his gin ala phe leu leu lys 
CAC CAA GCA TTC CTG CTC AAG 

1090 

val pro leu leu gly ser leu 
GTG CCA CTC CTG GGG TCA CTC 
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arg chr ala gin thr gin leu ser arg Ivs leu ore giy thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG .^LAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pre ala leu ore se- asD 
CTG AC- GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys chr ile leu asp O? 

TTC A.AG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTG7CACGCCGC-GCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAA.GGGCTGAGTGTC 

CAGCACACC-GCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAC-CTCTGGGAATTTC-GAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGC-GGTCCCTGTGC-GTCV^.aT-GGGC-GGAGGTGC 

AAAAAAAA.V>L 
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